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1. Introduction 



Let us introduce n pairs of random variables (Xi,Yi)i=i t ... tn that we suppose drawn from the pair 
(X,Y), valued in T x M, where J 7 is a semi- metric space. Let d denotes the semi-metric. Assume 
that there exists a regular version of the conditional probability of Y given X, which is absolutely 
continuous with respect to Lebesgue measure on IR and has bounded density. Assume that for a 
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given x there is some compact subset S := (a x ,(3 x ), such that the conditional density of Y given 
X = x has an unique mode 6(x) on S. 

In the remainder of the paper x is fixed in T and N x denotes a neighborhood of x. Let f x (resp. 
f x ^) be the conditional density (resp. the j th order derivative of the conditional density) of the 
variable Y given X = x. We define the kernel estimator f x of f x as follows: 

J \V) - 7777 -i TTTn . Vy G IK 

with the convention — = 0. The functions K and H are kernels and hx = Hk,u (resp. fin = hn.n) is a 
sequence of positive real numbers. Note that a similar estimate was already introduced in the special 
case where X is a real random variable by many authors, Rosenblatt (1969) and Youndje (1996) 
among others. For the functional case, see Ferraty et at (2006a). A natural and usual estimator of 
9{x) denoted 6(x), is given by: 

6{x) = argsup/ x (t/). (1) 

yes 

Note that this estimate 0(x) is not necessarily unique, so, the remainder of the paper concerns any 
value 9(x) satisfying |T]). 

The main goal of this paper is to study the nonparametric estimate 0(x) oi6(x) when the explanatory 
variable X is valued in the space T of eventually infinite dimension and when the observations 
(Y l ,X t ) i£ j^ are strongly mixing. 

The motivation for this mode regression model is its interest in some nonparametric estimation 
problems where the mode regression provides better estimations than the classical mean regression 
(see for instance Collomb et al. 1987, Quintela & Vieu (1997), Ould-Sad (1997), Berlinet et al. 
(1998), or Louani & Ould-Sad (1999), for the multivariate case). 

Currently, the progress of informatics tools permits the recovery of increasingly bulky data. These 
large data sets are available essentially by real time monitoring, and computers can manage such 
databases. The object of statistical study can then be curves (consecutive discrete recordings are 
aggregated and viewed as sampled values of a random curve) not numbers or vectors. Functional data 
analysis (FDA) (see Bosq 2000, Feratty and Vieu, 2006, Ramsay and Silverman, 1997, 2002) can help 
to analyze such high-dimensional data sets. The statistical problems involved in the modelization 
of functional random variables has received increasing interests in recent literature (see for example 
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Dabo-Niang 2002, Ferraty & Vieu 2004, Dabo-Niang & Rhomari 2004, Masry 2005 for nonparametric 
context). In this functional area, the first results concerning the conditional mode estimation were 
obtained by Ferraty et al. 2006a. They established the almost complete convergence of the kernel 
estimator in the i.i.d. case. This last result has been extended to dependent case by Ferraty et al. 
2005. Ezzahrioui and Ould-said (2006a, 2006b) have studied the asymptotic normality of the kernel 
estimator of the conditional mode for both i.i.d. and strong mixing cases. The monograph of Ferraty 
& Vieu 2006b presents an important collection of statistical tools for nonparametric prediction of 
functional variables. Recently, Dabo-Niang & Laksaci (2007) stated the convergence in L p norm of 
the conditional mode function in the independent case. 

In this paper, we consider the case where the data are both dependent and of functional nature. We 
prove the ^-integrated consistency by giving the upper bounds for the estimation error. We show 
how our results can be applied to prediction of functional times series, by cutting the past of the 
time series in continuous paths. As an application, we applied our method to some environmental 
data. 

The paper is organized as follows: the following Section is devoted to fixing notations and hypotheses. 
We state our results on Section 3. Section 4 is devoted to an application to a time series prediction 
problem. 

2. Notation and Assumptions 

We begin by recalling the definition of the strong mixing property. For this we introduce the follow- 
ing notations. Let T^{Z) denote the a— algebra generated by {Zj, i < j < k} . 

Definition 1 Let {Zi,i = 1,2, ...} denote a sequence of rv's. Given a positive integer n, set 
a(n) = sup{|P(AnB) -P(A)P(B)| : A e T\{Z) and B E F^+JZ), fee IN*}. 
The sequence is said to be a-mixing (strong mixing) if the mixing coefficient a(n) — > as n — > oo. 

There exist many processes fulfilling the strong mixing property. We quote, here, the usual ARMA 
processes which are geometrically strongly mixing, i.e., there exist p G (0, 1) and a > such that, 
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for any n > 1, a(n) < ap n (see, e.g., Jones (1978)). The threshold models, the EXPAR models (see, 
Ozaki (1979)), the simple ARCH models (see Engle (1982)), their GARCH extension (see Bollerslev 
(1986)) and the bilinear Markovian models are geometrically strongly mixing under some general 
ergodicity conditions. 



Throughout the paper, when no confusion is possible, we will denote by C or C some strictly positive 
generic constants, and we will use the notation B(x, h) = {x' G T : d(x' , x) < h}. Our nonparametric 
model will be quite general in the sense that we will just need the following assumptions: 

(HI) P(X eB(x,r)) =Mr) > 0. 

(H2) (Xi, Yi) ie j^ is an a-mixing sequence whose coefficients satisfy 



3a > 0, 3c> : Vn e IN a(n) < cn~ a . 

(Mh)) (a+1)/a 



(H3) Vz j, 

< supP[(Xi,Xj) e B(x,h) x B(x,h)] = O 
(H4) V( yi ,y 2 ) ESxS, V(x u x 2 ) E N x x N x , 

\f xi (yi) ~ f X2 (V2)\ < C {d( Xl ,x^ + \ Vl - y 2 \ b * 



n 



l/a 



h >0,b 2 > 0. 



(H5) f x is j-times continuously differentiable with respect yon S such that, f x ^ l \6{x)) — 0, for 1 < 



I < j, and 



f x(i) (y) 



< oo, for all y e S. 
(H6) K is a function with support (0, 1) such that < C < K{t) < C < oo. 

(i) There exists an integrable function g such that 
\H(t)-H(s)\<Cg(\t-s\), 
(ii) J \t\ b2 H(t)dt < oo , and J H(t)dt = 1. 



(H7) H is a function which satisfies : < 



The concentration propriety (HI) is less restrictive that the fractal condition introduced by Gasser 
et al. (1998) and is known to hold for several continuous time processes (see for instance Bogachev 
(1999) for a gaussian measure, Li & Shao (2001) for a general gaussian process and Dabo-Niang 
& Laksaci (2007) for more discussion). In order to establish the same convergence rate as in the 
i.i.d. case (see Niang & Laksaci (2006) ), we reinforce the mixing by introducing (H2) and (H3). 
Note that we can establish the convergence results without these mixing assumptions, however, the 
convergence rate expression will be perturbed, it will contain the covariance term of the observations. 
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Assumptions (H4) and (H5) are regularity conditions which characterize the functional space of our 
model and are needed to evaluate the bias term in our asymptotic devlopments. It should be noted 
that, the flatness of the function f x around the mode 9{x) controlled by the number of vanishing 
derivatives at 9(x) ( assumption (H5)), has a great influence on the asymptotic rates of our estimate 
(see Theorem [1]). Assumptions (H6) and (H7) are standard technical conditions in nonparametric 
estimation. They are imposed for the sake of simplicity and brevity of the proofs. 

3. Main results 

In this Section, we establish the p-mean rate of convergence of the estimate 6{x) to 9(x). 



Theorem 1 Under the hypotheses (H1)-(H7), we have for all p £ [j, oo[ 

i/p / £i / / 1 



o 



n(j) x {h K ) 



whenever 



377 > 0, CtiS^ + " < cf> x (h K ) < Cn& 
holds with a > max (p + 1, + p + ^/(4 + p) 2 - 4 - 8pj /2j . 



(2) 



Proof. Let us now write the following Taylor expansion of the function f x under (H5) 

r(9(x)) = t(9{x)) + If «(<r)(0(z) - 9(x)y, 
for some 9* between 9{x) and 9{x). By simple analytic arguments, we can show that 



\9{x)-9(x)\ j < 



J- 



min f x(j) (y)y<=( a *A) 

2/e(a x ,/5x) 



sup \f x (y)-f x (y)\. 



The Minkowski inequality permits us to write: 

f x (y)-f x (y) 



sup 

j/G(c»x,/3x) 



< 



Y, W m (x) sup \Hi(y) - E {Hi(y)/Xi)\ 

j/e(a x ,/3x) 



i=l 



Y, Wm(x) sup \E (Hi{y)\Xi) - f x (y)\ 



i=l 



1/p 



sup \r(y)\{P{YWni(x) = 

V^{a^,l3 x ) \ \, = 1 ; 



Vp > j, 



imsart-ejs ver. 2008/01/24 file: ejs_2008_347.tex date: December 30, 2008 



Laksaci and Dabo-Niang/ 'Conditional mode regression 



6 



where 



W ni {x) 



K{h^d{x,Xi)) 



and H i (y) = h] I 1 H(h- H L (y-Y i )) 



E?=i^(^M)) 
Then, Theorem Q] is a consequence of the following lemmas. 

Lemma 1 We get under the hypotheses (HI), (H4), (H6) and (H7(ii)): 

n 

J2W ni (x) snp \E(Hi(y)\ X t ) - f x (y)\ = 0(h b *) + O(h^). 
i=i yeJR p 

Lemma 2 Under the hypotheses (H1)-(H3), (H6) and (H7(i)), we have for all p £ [j,oo[ 

i 

1 



Vw ni (x) sup \Hi(y) - E (Hi{y)\ X, 



O 



n(f> x {h K ) 



whenever 

3rj > 0, Cn^T^ +71 < <j) x (h K ) < C'n^ 
holds with a > max (p + 1, (a + p + ^(A+p) 2 - 4 - 8pj /2^j . 

Lemma 3 Under the conditions of Lemma\^ we have: 



P[J2W ni (x) = 



i/p 



n4> x {hi() 



(3) 



4. Application 



The most important application of conditional mode estimation when the observations are dependent 
and of functional nature is the prediction of future values of some process by taking into account 
the whole past continuously. Indeed, let (^t)tg[o,b[ be a continuous time real valued random process. 
From Z t we may construct N functional random variables (-^j)j=i,...,jv defined by: 

Vf 6 [0, b[, Xi[t) = Zjv-i^t-i^+t), 

and a real characteristic Yi = G(Xi + i). The above consistency result permits to predict the char- 
acteristic Ym by the conditional mode estimate Y = 6(Xn) given by using the N — 1 pairs of r.v 
(Xi, li)i=i,... J jv-i- 
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5. Appendix 



Proof of lemma [T] : By definition of the L p norm, we have 



= E 1/p 



]T W m (x) sup \E (Hiiy^Xi) - f (y)| 



J^WniWlB^^Xi) sup ^{Hi^Xi)- f x {y)\ 

y-z 



where 1 is the indicator function. If we consider the change of variables t = , then we get under 

h H 

(H7 (ii)) and (H4) 

\E(H t (y)\X t )-r(y)\< f \H{t) [f x *(y - th H ) - f \y)\\ dt, 



(\th H \ b2 + \h K \ b i)H(t)dt 



l B fr M {Xi) \(E (Hi(y)\Xi) - f x (y))\ < C 
We deduce from Y^i=i W n i(x) = 1, that: 



y^Wniix) sup |^(ffi(l/)|^i)-/ a (»)l 

i=1 ae(Q x ,/3x) 

The proves the lemma. 



< C 



{\th H \ b2 + \h K \ bl )H(t)dt 



Proof of lemma [2] 



It is easy to see that 



W ni (x) sup \Hi(y) - E (H t (y)\X t 
i=i yelR 



< CE 1/p 



p/2 / n 

su V W nj ) ( K( 2 (x) sup \Hi(y) - E (Hiiymi 

aeIR 



p-i 



1 ' \j=i 

By (H7 (i)), the definition of Eli and the boundedness of the conditional density with respect to the 
two variables, we have for all i: 

\H i {y)-E(H i (y)\X i )\=h- H 1 H \h H l (y - Y$) - J H{h H \y - z))f x '(z)dz 

H{h H 1 {y-Y l ))-H{h H \y-z))\f x '{z)dz <C J g (\h-\Y, - z)\) dz 
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It follows from the usual change of variables hJj 1 (Y i — z) = t that 

iHM-Emy^KC J g(\t\)dt. 
Moreover, by Cauchy-Schwartz inequality, we can write 

n 

,1/2 



i=i 



So, we deduce that 



VVniO*) SU P \Hi(y)-E(Hi(y)\Xi)\ 

ye(a x ,/3 x ) 



i=i 



p 

p/2 



g(\t\)dt 



\ P/2 



/ \ P/^ 

We first evaluate the quantity £7 ' p I sup W nj - . For this, set U = K{h~^d{X n , x)), u = E(U) 
and V — Y^=i K{h~^d{Xj,x)), it is clear that 

C"Mh K ) <u<C<j> x {h K ) 

and E(V) = (n — l)u. If we consider the random variable Z n _i = min(l, C/V), then, for all j 

W nj {x) = K{H£d{X h x))/{U + V)< Z„_i. 

We have for all c > 0; 

Z P J_\ = Z p J_\l v<c + Z p J_\l v > c < l v<c + (Clcfl\ 
For c = (n — l)u/2, we get: 



E(Z p J_\)<P(V<(n-l)u/2) + 



C 



(n-l)i 



p/2 



Clearly, 



P(V < (n- l)u/2) = P(V-£(V) <-(n-l)u/2) 

; (n- l)u/2) 
: /' I ^r^|V-£7(V)| > 1/2 



(n - l)^i 1 



By using the Fuk-Nagaev's inequality (Rio 1999) we can show that, for all A > and r > 1 

1 



'" l Tmr \V-E{V)\>A\\<A 1 +A 2 
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where 



Ai = H — y / ; and A 2 = 4 1 



A 2 < C*(n - l)r Q (n - l)-( Q+1 )/ 2 <^(/^)~ (a+1)/2 (>g(™ - l))-^ 1 )/ 2 

By taking r = 0((log (n — l)) 2 ) and by using the left part of inequality (J3j> , we can find rl > such 
that 

A 2 < Cn-'i'n-P/ 2 cf> x (h K )-P/ 2 . (4) 

For A 2 , we must evaluate asymptotically the quantity 

n-i 

yar(Z) = ^ Cov(Ki,Kj) := s 2 * +nVar(K 1 ) 

where 

ra-1 

In the sequel, we use techniques developed by Masry (1986) to give the asymptotic behavior of s 2 *. 
Define the sets 

S\ = such that 1 < i — j < m n } 

and 

52 = {(i, j) such that m n + I < i — j < n — 1} 

where the sequence m„ is chosen such that m„ — > 00. We denote by J\ n and J2 iTt be the sum of the 
covariance over Si and S2 respectively. Then, 

Ji, n = \ Cov ( K ^ K j)\ ^ \ EK i R i - EKiEKj\. 

Si Si 

Because of (HI), (H3) and (H6) we can write 
Jl,n < Cnm n 4>x 



On the other hand, to study the sum over S2, we use the Davydov-Rio's inequality (see Rio, (1999)) 
in the L°° cases. This leads, for all i 7^ j to 

ICoviK^K^KCadi-jl) 
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and therefore we get 

Ji,n = ICoviKuKj)] < Cn 2 m- a . 
s 2 

By choosing m n — ^ K ) , using the right part of ([3]), we obtain 
V n J 

n-l 

J2Cov(K i ,K j ) = 0(n<f> x (h K )). (5) 

Now, for alii = 1 , ... n — 1 we can write 

Var{K x ) = E(K 2 ) - {EK^f . 

By using (HI) we get 

Var^) < C(<f> x (h K ) + {<i> x {h K )) 2 ). 
Finally, this last result combined with ([5]) leads directly to 

Var(Z) = 0((n-l)cj )x (h K )). (6) 

This allows us to deduce that 

A2log(n-l)V r/2 , 01 A , Aglog(n-l) 



Choosing r = C(log(n — l)) 2 , yields that 

A 1 <Cexp(-A^)=^/3 2 . 

As (p < oo ), for Ao large enough, 

A x < Cn~ x o/ 32 < Cn- l 'n- p / 2 (l) x (hKy p/2 for some v > 0. (7) 
So, by combining the results (g]) and {7} we get, for A large enough, 

P(V<(n-l)«m = («) 

which implies that 

/ / 1 \ W 

SVP^) = O 



1/2N 



ruf) x {hK) 

So, 

n 

sup |fli(y) -^(^(yJlXOI = O 
i—l yeH 



n<f> x (h 



P 
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Proof of lemma [3] 

We set Z = -j^- ^ K l , then, W ni {x) =Qj\= (P(Z = 0)) . It is clear that, for all e < 1, 

we have 

(P(Z = 0)) <(P(Z<l-e)) 

<(P(\Z-l\>e)). 

By following similar arguments to those involved in the proof of ([5]), we have that: 

p/2\ 



P(\Z-l\>e)=o 

This implies that, 



1 



n4> x {h K ) 



(VP) / , x 1/2N 

n<p x (h K ) 



Acknowledgements : The authors thank Professors F. Ferraty and Ph. Vieu of the LSP, University 
of Paul Sabatier, and N. Rhomari of University of Oujda, for helpful discussions and comments. 

References 

Bogachev, V.I. (1999), Gaussian measures. Math surveys and monographs, 62, Amer. Math. Soc. 

Bollerslev, T., (1986), General autoregressive conditional heteroskedasticity. J. Econom., 31, 307- 
327. 

Bosq, D. (2000) Linear Processes in Function Spaces: Theory and applications. Lecture Notes in 
Statistics, 149, Springer. 

Collomb, G., Hardle, W. and Hassani, S. (1987), A note on prediction via conditional mode estima- 
tion. J. Statist. Plann. and Inf., 15, 227-236. 

Dabo-Niang, S. and Rhomari, N., (2003), Estimation non paramtrique de la rgression avec variable 
explicative dans un espace mtrique. C. R., Math., Acad. Sci. Paris 336, No.l, 75-80. 



imsart-ejs ver. 2008/01/24 file: ejs_2008_347.tex date: December 30, 2008 



Laksaci and Dabo-Niang/ 'Conditional mode regression 12 

Dabo-Niang, S. and Rhomari, N., (2002), Nonparametric regression estimation when the regres- 
sor takes values in a metric space, Technical Repport 2002-9, LSTA Univ. paris 6, 2001, http : 
/ /www. ccr.jussieu.fr/lsta/ R2002g.pdf. 

Dabo-Niang, S. and Laksaci, A., (2007), Estimation non paramtrique du mode conditionnel pour 
variable explicative fonctionnelle, C. R., Math., Acad. Sci. Paris, 344, 49-52. 

Dabo-Niang, S. and Laksaci, A., (2006), Nonparametric estimation of conditional quantiles when the 
regressor is valued in a semi-metric space, submitted. 

Englc, R.F., (1982), Autoregressive conditional heteroskedasticity with estimates of the variance of 
U.K. inflation. Econometrica, 50, 987-1007. 

Ezzahrioui, M., Ould-Said, E. (2005), Asymptotic normality of nonparametric estimators of the 
conditional mode function for functional data. LMPA, Aot 2005 (Preprint). 

Ezzahrioui, M., Ould-Said, E. (2006b), On the asymptotic properties of a nonparametric estimator 
of the conditional mode function for functional dependent data. LMPA, janvier 2006 (Preprint). 

Ferraty, F., Vieu, Ph. ,(2004), Nonparametric models for functional data, with application in regres- 
sion, time-series prediction and curve discrimination. J. Nonparametric Stat. 16, No. 1-2, 111-125. 

Ferraty, F., Laksaci, A. and Vieu, P. (2005), Functional times series prediction via conditional mode. 
C. R., Math., Acad. Set. Pans 340, No.5, 389-392 

Ferraty, F., Laksaci, A. and Vieu, P. (2006a), Estimating some characteristics of the conditional 
distribution in nonparametric functional models. Statist. Inf. for Stoch. Proc. 9, No. 2, 47-76. 

Ferraty, F. and Vieu, Ph., (2006b), Nonparametric functional data analysis. Springer- Verlag, New- 
York. 

Jones, DA. (1978), Nonlinear autoregressive processes. Proc. Roy. Soc. London A 360, 71-95. 

Li, W.V. and Shao, Q.M. (2001), Gaussian processes: inequalities, small ball probabilities and ap- 
plications. In Stochastic processes: Theory and Methods, Ed. C.R. Rao and D. Shanbhag. Hanbook 
of Statistics, 19, North-Holland, Amsterdam. 

Louani, D. and Ould-Said, E., (1999), Asymptotic normality of kernel estimators of the conditional 



imsart-ejs ver. 2008/01/24 file: ejs_2008_347.tex date: December 30, 2008 



Laksaci and Dabo-Niang/ 'Conditional mode regression 

mode under strong mixing hypothesis. J. Non-parametric Stat., 11 (4), 413-442. 



13 



Masry, E. (1986) Recursive probability density estimation for weakly dependent stationary processes. 
IEEE, Trans. Inform. Theory, 32, 254-267. 

Masry, E. (2005), Nonparametric regression estimation for dependen functional data: asymptotic 
normality. Stochastic Process. Appl. 115, No.l, 155-177. 

Ozaki, T., (1979), Nonlinear time series models for nonlinear random vibrations. Technical report. 
Univ. of Manchester. 

Ramsay, J. O and Silverman, B.W. (1997) Functional data analysis. Springer, New York. 

Ramsay, J.O. and Silverman, B.W. (2002). Applied functional data analysis; Methods and case stud- 
ies. Springer- Verlag, New York. 

Rio, E. (1999). Theorie asymptotique des processus faiblement dependants. Mathematiques & Ap- 
plications, 31, Springer-SMAI. 

Ould-Sad, E., (1997), A note on ergodic processes prediction via estimation of the conditional mode 
function. Scand. J. Stat. 24, No.2, 231-239. 

Quintcla del Rio, A. and Vieu, Ph., (1997), A nonparametric conditionnal mode estimate. Nonpara- 
metric Statistics, 8, 253-266. 

Rosenblatt, M., (1969), Conditional probability density and regression estimators. In Multivariate 
Analysis II, Ed. P.R. Krishnaiah. Academic Press, New York and London. 

Youndj, E., (1993), Estimation non paramtrique de la densit conditionnelle par la mthode du noyau. 
Thse de Doctarat, Universit de Rouen. 



imsart-ejs ver. 2008/01/24 file: ejs_2008_347.tex date: December 30, 2008 



